Implement per-pixel linked list for OIT #21831

beicause · 2025-11-14T02:22:56Z

Objective

The current OIT stores viewport-sized fragments per layer. It uses much more memory than it can be.

Solution

Implements per-pixel linked list for OIT, which saves memory and can handle more layers. The implementation references https://github.com/KhronosGroup/Vulkan-Samples/tree/main/samples/api/oit_linked_lists

Testing

Tested with the order_independent_transparency example. I also added a new scene in it.

Details

<= 256mb

IceSentry

This is awesome. Thank you so much for working on this. Sorry it took so long for me to review, I got sick in the same week you opened the PR and haven't had time to come back to it since.

This is very close to what I had in mind as a follow up to my original OIT impl so I'm really happy to see it in action.

I managed to review the PR because I'm very familiar with OIT but to make the diff a bit simpler to follow I would suggest adding the depth prepass support in a separate PR to the current OIT impl. This way the linked list PR will be a bit easier to follow since it won't be mixed with the depth prepass changes.

crates/bevy_core_pipeline/src/core_3d/main_transparent_pass_3d_node.rs

crates/bevy_core_pipeline/src/oit/oit_draw.wgsl

crates/bevy_core_pipeline/src/oit/mod.rs

It's head

crates/bevy_core_pipeline/src/oit/resolve/oit_resolve.wgsl

Add `reserve_internal` to `BufferVec` Add `capacity` `set_label` `get_label` to `UninitBufferVec` Use `Vec::reserve` to reduce some allocation

crates/bevy_core_pipeline/src/oit/oit_draw.wgsl

goodartistscopy · 2026-01-11T17:33:09Z

This is in pretty good shape I believe. If you're not in a hurry to merge, I'd like to try a possible improvement where, instead of pulling the fragments in an array first at resolve, we would iterate over the linked list (N times) and pop the closest fragment. That might sound bad at first because it's O(N²) accesses to a storage buffer (non contiguous at that).
However:

After the first pass, the fragments might reside in cache, making accesses not so bad (to be validated of course)
It eliminates the large array that might make the compiler allocate a ton of registers which hurts occupancy, or even spill to VRAM which is not good either.
If that works then the number of layers is truly unlimited per pixel, the only limit being the global budget of the allocated fragment buffer.

What do you think ?

beicause · 2026-01-12T07:59:28Z

From what I see in backwards memory allocation and register-based block sort papers (they are complex and not worth it for games typically have fewer transparency layers), I think there is reason to believe in-place sorting in ssbo is slower.

crates/bevy_core_pipeline/src/oit/resolve/oit_resolve.wgsl

Implement per-pixel linked list for OIT

61a1e0c

beicause force-pushed the oit-opt branch from a9d91e6 to 61a1e0c Compare November 14, 2025 02:26

IceSentry self-assigned this Nov 14, 2025

IceSentry self-requested a review November 14, 2025 04:32

IceSentry removed their assignment Nov 14, 2025

IceSentry added C-Feature A new feature, making something new possible A-Rendering Drawing game state to the screen labels Nov 14, 2025

github-project-automation bot added this to Rendering Nov 14, 2025

IceSentry added S-Needs-Review Needs reviewer attention (from anyone!) to move forward D-Shaders This code uses GPU shader languages labels Nov 14, 2025

beicause added 6 commits November 14, 2025 18:09

update

8a3c399

Fix

7ebd383

make use of depth prepass to filter out fragments

9d0dbc3

Fix corrupted linked list on startup

c414d2e

change OIT default value

395acf0

<= 256mb

Sort in desc order and fix early termination in blending

a2583f6

beicause force-pushed the oit-opt branch from 0d16699 to 0bc65e6 Compare November 26, 2025 15:29

Merge remote-tracking branch 'upstream' into oit-opt

cd963ea

beicause force-pushed the oit-opt branch from 78c03e3 to cd963ea Compare November 26, 2025 15:34

format

6398884

IceSentry approved these changes Dec 6, 2025

View reviewed changes

crates/bevy_core_pipeline/src/core_3d/main_transparent_pass_3d_node.rs Outdated Show resolved Hide resolved

crates/bevy_core_pipeline/src/oit/oit_draw.wgsl Show resolved Hide resolved

beicause added 2 commits December 6, 2025 13:19

fmt

a8028b5

Merge remote-tracking branch 'upstream' into oit-opt

ae57011

beicause mentioned this pull request Dec 17, 2025

Weighted blended OIT and unsorted transparent #21782

Closed

IceSentry added this to the 0.19 milestone Dec 29, 2025

goodartistscopy reviewed Jan 8, 2026

View reviewed changes

crates/bevy_core_pipeline/src/oit/mod.rs Outdated Show resolved Hide resolved

beicause added 2 commits January 9, 2026 11:44

Merge remote-tracking branch 'upstream' into oit-opt

1fbeaa7

Rename header -> head

2babad2

It's head

goodartistscopy reviewed Jan 9, 2026

View reviewed changes

crates/bevy_core_pipeline/src/oit/resolve/oit_resolve.wgsl Show resolved Hide resolved

Add some methods and optimizations in buffer vec

0290b64

Add `reserve_internal` to `BufferVec` Add `capacity` `set_label` `get_label` to `UninitBufferVec` Use `Vec::reserve` to reduce some allocation

Avoid writing oit heads buffer by offseting values by 1

c07054a

beicause force-pushed the oit-opt branch from cb2c9f4 to c07054a Compare January 11, 2026 00:22

goodartistscopy mentioned this pull request Jan 11, 2026

Investigating BufferVec::push() performance improvement #22361

Closed

goodartistscopy reviewed Jan 11, 2026

View reviewed changes

crates/bevy_core_pipeline/src/oit/oit_draw.wgsl Show resolved Hide resolved

Save local memory for sorted frags

41fb34a

Merge remote-tracking branch 'upstream/main' into oit-opt

fe52562

goodartistscopy reviewed Jan 12, 2026

View reviewed changes

crates/bevy_core_pipeline/src/oit/resolve/oit_resolve.wgsl Outdated Show resolved Hide resolved

goodartistscopy and others added 5 commits January 12, 2026 13:57

Store packed depth and alpha to save local memory

545d99a

Swap depth/alpha in packed repr. to make it monotonic with depth

77c783d

Finalize previous commit

47b9642

Merge branch 'oit-depth-opt' into oit-opt

706529f

1c68b0d

Provide feedback

Saved searches

Use saved searches to filter your results more quickly

Uh oh!

Uh oh!

Implement per-pixel linked list for OIT #21831

Implement per-pixel linked list for OIT #21831

Uh oh!

beicause commented Nov 14, 2025 •

edited

Loading

Uh oh!

IceSentry left a comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

goodartistscopy commented Jan 11, 2026 •

edited

Loading

Uh oh!

beicause commented Jan 12, 2026 •

edited

Loading

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

Uh oh!

Implement per-pixel linked list for OIT #21831

Are you sure you want to change the base?

Implement per-pixel linked list for OIT #21831

Uh oh!

Conversation

beicause commented Nov 14, 2025 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Objective

Solution

Testing

Uh oh!

IceSentry left a comment

Choose a reason for hiding this comment

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

Uh oh!

goodartistscopy commented Jan 11, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

beicause commented Jan 12, 2026 • edited Loading Uh oh! There was an error while loading. Please reload this page.

Uh oh!

Uh oh!

Uh oh!

Reviewers

Assignees

Labels

Projects

Milestone

Development

Uh oh!

3 participants

beicause commented Nov 14, 2025 •

edited

Loading

goodartistscopy commented Jan 11, 2026 •

edited

Loading

beicause commented Jan 12, 2026 •

edited

Loading